Store dist manifest in JSON to improve load performance #4344

Kobzol · 2025-05-22T07:05:40Z

Currently, the DIST_MANIFEST (multirust-channel-manifest) is being stored in TOML, which is not great for performance. The file has ~1 MiB on disk, and loading it takes ~24ms on my notebook. That's more than half the runtime of invoking rustc --version through rustup.

Since the manifest is generated and read only by rustup (AFAIK), I think that we could use a faster format to reduce this bottleneck. In this PR I chose to use JSON, for several reasons:

It is a very stable format that is unlikely to introduce breaking changes in the near future.
serde_json is well maintained and unlikely to be abandoned, and there are many other JSON (de)serialization alternatives.
JSON is more or less human readable if someone needed to inspect the file manually.
The JSON manifest is much faster to parse than TOML, ~1.7ms vs ~24ms on my machine.
serde_json can out of the box deal with the current Manifest format. Since it uses #[serde(from...]) and skip_serializing, this breaks other formats, such as bincode, rmp_serde (MsgPack) and Postcard. These would produce smaller files and even higher performance than JSON, but they would also require some changes to the Manifest structure.

The manifest is parsed on more places (e.g. make_component_unavailable), but I only modified the DIST_CHANNEL file, since that was the bottleneck that I saw.

Context: #2626

djc · 2025-05-22T07:32:46Z

Thanks for working on this!

Not sure whether this is intended as just sharing your experiment or whether you actually want to land this.

I feel like we should explore #2626 (comment) a bit before we invest time in this.

adamreichold · 2025-05-22T07:47:36Z

I feel like we should explore #2626 (comment) a bit before we invest time in this.

Especially since that reduced list of installed components might be more amenable to a smaller binary format like Postcard compared to the full manifest.

Kobzol · 2025-05-22T07:53:15Z

I mostly wanted to see how much code would have to be changed to use the JSON format, seems like not that much. So I don't think it would be so bad to land as-is.

Doing the other approach would likely require larger code changes, but it might be a better solution overall. I would have to first understand how do the manifests work to implement it though.

Kobzol · 2025-05-27T10:56:59Z

The structure of the code makes it a bit challenging to change this. Essentially, the code first loads the full manifest, then it goes through all its rust components for a given target, and for each of them it checks that it is installed. We would need to store some smaller submanifests split by target (one file per target), which would only contain the rust package to make rustup load less data.

However, while investigating this, I also noticed something interesting. In the "common case", where you just execute rustc without any rust-toolchain.toml override, the list of components and targets to be checked is actually empty. So in that case we load the manifest for no reason, we parse it, go through all the components, and then we do .all() on an empty iterator.

#4350 should fix this.

Kobzol · 2025-05-27T14:20:41Z

I think that this change is no longer worth it now that #4350 has been merged.

bjorn3 · 2025-05-27T14:57:55Z

Wouldn't this still be worth it for the people who do specify components or targets in rust-toolchain.toml? Those are pretty common in my experience.

Kobzol · 2025-05-27T15:02:15Z

It would definitely make the parsing faster for their use-cases. Not sure whether it is worth it to solve that with this specific approach, it seemed like it wasn't very popular with rustup maintainers. I'm of course willing to reopen and finish this if there's interset :)

djc · 2025-05-27T15:05:43Z

The structure of the code makes it a bit challenging to change this. Essentially, the code first loads the full manifest, then it goes through all its rust components for a given target, and for each of them it checks that it is installed. We would need to store some smaller submanifests split by target (one file per target), which would only contain the rust package to make rustup load less data.

Maybe it would make sense to store a "mini manifest" only for targets that are installed?

Kobzol · 2025-05-27T15:29:09Z

I guess it would be possible, yeah. And if the target file is not on disk, it would be considered to be missing. Note that I'm not sure how "mini" would the manifests be, it looked like the rust component, cat multirust-channel-manifest.toml | grep "pkg.rust\." | wc -l has 5757 entries for the stable toolchain (although many of these are duplicated for multiple targets).

Store dist manifest in JSON to improve load performance

9c912e4

Kobzol mentioned this pull request May 22, 2025

improve rustc wrapper startup time? #2626

Open

Kobzol mentioned this pull request May 27, 2025

Skip manifest loading if there are no components/targets to check #4350

Merged

Kobzol closed this May 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Store dist manifest in JSON to improve load performance #4344

Store dist manifest in JSON to improve load performance #4344

Uh oh!

Kobzol commented May 22, 2025 •

edited

Loading

Uh oh!

djc commented May 22, 2025 •

edited

Loading

Uh oh!

adamreichold commented May 22, 2025

Uh oh!

Kobzol commented May 22, 2025

Uh oh!

Kobzol commented May 27, 2025 •

edited

Loading

Uh oh!

Kobzol commented May 27, 2025

Uh oh!

bjorn3 commented May 27, 2025

Uh oh!

Kobzol commented May 27, 2025

Uh oh!

djc commented May 27, 2025

Uh oh!

Kobzol commented May 27, 2025

Uh oh!

Uh oh!

Store dist manifest in JSON to improve load performance #4344

Store dist manifest in JSON to improve load performance #4344

Uh oh!

Conversation

Kobzol commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

djc commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adamreichold commented May 22, 2025

Uh oh!

Kobzol commented May 22, 2025

Uh oh!

Kobzol commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Kobzol commented May 27, 2025

Uh oh!

bjorn3 commented May 27, 2025

Uh oh!

Kobzol commented May 27, 2025

Uh oh!

djc commented May 27, 2025

Uh oh!

Kobzol commented May 27, 2025

Uh oh!

Uh oh!

Kobzol commented May 22, 2025 •

edited

Loading

djc commented May 22, 2025 •

edited

Loading

Kobzol commented May 27, 2025 •

edited

Loading